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Abstract 



The top eigenvalues of rank r spiked real Wishart matrices and additively perturbed 
Gaussian orthogonal ensembles are known to exhibit a phase transition in the large size 
limit. We show that they have limiting distributions for near-critical perturbations, 
fully resolving the conjecture of Baik, Ben Arous and Peche (2005). The starting point 
is a new (2r + l)-diagonal form that is algebraically natural to the problem; for both 
models it converges to a certain random Schrodinger operator on the half-line with 
r X r matrix-valued potential. The perturbation determines the boundary condition, 
and the low-lying eigenvalues describe the limit jointly over all perturbations in a fixed 
subspace. We treat the real, complex and quaternion {(3 = 1, 2, 4) cases simultaneously. 
We also characterize the limit laws in terms of a diffusion related to Dyson's Brownian 
motion, and further in terms of a linear parabolic PDE; here /3 is simply a parameter. 
At /? = 2 the PDE appears to reconcile with known Painleve formulas for these r- 
parameter deformations of the GUE Tracy- Widom law. 
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1 Introduction 



This article is a sequel to Bloemendal and Virag (2010), which we cite throughout as Part I. 
We refer the reader to the introduction of that paper for background on the spiked Wishart 
model introduced by Johnstone (2001) in the context of high-dimensional data analysis, 
especially the phase transition phenomenon that Baik, Ben Arous and Peche (2005) described 
in the complex case. We refer the reader to the same place for background on the stochastic 
Airy operator introduced by Sutton (2005), Edelman and Sutton (2007), and its development 
by Ramirez, Rider and Virag (2011) that we build on. In this paper we cite Baik et al. as 
BBP and Ramirez et al. as RRV. 

In Part I we considered rank one spiked real/complex/quaternion Wishart matrices and 
additive rank one perturbations of the Gaussian orthogonal, unitary and symplectic ensem- 
bles. We introduced their general /3 analogues in terms of "spiked" versions of the tridiagonal 
ensembles of Dumitriu and Edelman (2002) and extended the RRV technology to describe 
the soft-edge scaling limit in terms of the stochastic Airy operator 

on L^(M_|_) with a boundary condition depending on the spike. The boundary condition 
changes from Dirichlet /(O) = to Neumann/Robin /'(O) = wf{0) at the onset of the 
BBP phase transition, with w G M representing a scaling parameter for perturbations in 
a "critical window" . The resulting largest eigenvalue laws form a one-parameter family 
of deformations of Tracy- Widom(/3), naturally generalizing the characterization of RRV in 
terms of the ground state of this random Schrodinger operator. 

We went on to characterize the limit laws in terms of the diffusion from RRV and further 
in terms of an associated second-order linear parabolic PDE. Finally, we showed that at 
/3 = 2,4 the PDE was related to known Painleve II representations originating in Baik and 
Rains (2000) and gave new proofs of these, recovering those of the corresponding undeformed 
Tracy- Widom laws. 

Even the existence of limiting distributions in the critical regime was in general new for 
/3 7^ 2, though see the prior work of Wang (2008) on the rank one /3 = 4 case at w = 0, as 
well as the subsequent work of Mo (2011) offering a more standard treatment of the rank 
one P = 1 case. Forrester (2011) comments on all three works and gives an alternative 
interpretation and construction of our general /3 rank one spiked model. 

Here we deal with r "spikes", or general bounded-rank perturbations of Gaussian and 
Wishart matrices. To do so we introduce a new "canonical form for perturbations in a 
fixed subspace", a (2r + l)-diagonal band form that has a purely algebraic interpretation. 
It generalizes the Dimitriu-Edelman forms and is able to handle rank r perturbations. We 
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then develop a generalization of the methods of RRV and Part I to a matrix- valued setting: 
block tridiagonal matrices converge to a half-line Schrodinger operator with matrix-valued 
potential, the spikes once again appearing in the boundary condition. We treat the real, 
complex and quaternion (/3 = 1,2,4) cases simultaneously. Once again, even the existence 
of a near-critical soft-edge limit is new off /3 = 2. Unlike in Part 1, however, we do not define 
a general (3 version of either matrix model, nor of the limiting operator; we will see that in 
the higher rank cases these objects do not readily admit a /3-generalization. 

Dyson's Brownian motion makes a surprise appearance, providing nice SDE and PDE 
characterizations of the limit laws — r parameter deformations of Tracy- Widom(/3) — in which 
P reappears as a simple parameter. The derivation makes use of the matrix versions of 
classical Sturm oscillation theory and the Riccati transformation. In a short final section we 
report on evidence that at /3 = 2 the PDE can be connected with a Painleve II representation 
of Baik (2006) for these distributions that appeared originally in EBP in the form of Fredholm 
determinants. 

We highlight two more features of our approach beyond the novelty of bypassing joint 
densities and handling /3 = 1,2,4 together. First, we treat the perturbation as a parameter. 
By this we mean that all perturbations in a fixed subspace are considered jointly (on the 
same probability space); this picture is carried through to the limit, which is therefore a 
family of point processes parametrized by an r x r matrix. Second, we allow more general 
scalings than those considered in BBP, Most importantly, in the Wishart case we do not 
require the two dimensional parameters n,p to have a positive limiting ratio but rather allow 
them to tend to infinity together arbitrarily. 

To state our results we introduce some objects and notation that will be used throughout 
the paper. 

Let F = M, C, or H and /3 = 1, 2 or 4, respectively. A standard F Gaussian Z ~ 
FA^(0, 1) is an F-valued random variable described in terms of independent real Gaussians 
gi,...,gfs^ iV(0, 1) as gi for F = M, (^i + ^2«)/V2 for F = C, and {gi+ g2i + 93j + 94k)/2 for 
F = H. Note that in each case E \Z\^ = 1 and uZ ~ FA^(0, 1) for m e F with \uf = u*u = 1. 

The space of column vectors F" is endowed with the standard inner product u^v and 
associated norm = m^m (we reserve double bars for function spaces). Write FA^„(0, /) for 
a vector of independent standard F Gaussians. With E G M„(F) positive definite, we write 
Z ~ ¥NniO, S) for Z = Y}I'^Zq with Zq ~ FAr„(0, /). 

Define the unitary group ?7„(F) = {U E F"^" : WU = I}, better known as the 
orthogonal, unitary or symplectic group for F = M, C, EI respectively. It acts on F" by left 
multiplication, on which the distribution FA'"„(0, /) is invariant. Write M„(F) = {A G F"^" : 

= A} for the self-adjoint matrices, also known as real symmetric, complex hermitian 
or quaternion self-dual. Un(¥) acts on M„(F) by conjugation. 
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The Gaussian orthogonal/unitary/symplectic ensemble (GO/U/SE) is tlie prob- 
ability measure on Mn(F) described hy A = (X + X'^)/\^ where X is an n x n matrix 
of independent FA^(0, 1) entries. The distribution is invariant under the unitary action. 
Furthermore, the algebraically independent entries Aij, i > j are statistically independent. 
(Together, this invariance and independence characterizes the distribution up to a scale fac- 
torr.) For an entry-wise description, the diagonal entries are distributed as N{0,2//3) while 
the off-diagonal entries are FA^(0, 1). 

Fixing a positive integer r, we study rank r additive perturbations A = Aq + P of 
a GO/U/SE matrix Aq, where P = P © 0„_,. with P e Mr(¥) nonrandom. We will be 
interested in the eigenvalues Ai > ■ ■ ■ > A„ of A. Of course for a single P their distribution 
depends only on the eigenvalues of P, but we consider them jointly over all P. 

We also consider real/complex/quaternion Wishart matrices. These are random 
nonnegative matrices in Mp(F) given by XX^ where the data matrix X is p x n with 
n independent FA'p(0, S) columns. We speak of a p-variate Wishart with n degrees of 
freedom and p x p covariance S > 0. Since we are interested in the nonzero eigenvalues 
Ai > ■ ■ ■ > AnAp, we can equally well consider X'^X. The distribution of X'^X may also be 
described as XqEXq where Xq is a. p x n matrix of independent FiV(0, 1) entries. The case 
S = / is referred to as the null case. We study the rank r spiked case where S = T,(Blp-r 
with S G Mr{¥) nonrandom. Once again the eigenvalue distribution depends only on the 
eigenvalues of S, but we consider the spectrum jointly as S varies. 

Our starting point is a new banded or multidiagonal form introduced in Section 2, ideally 
suited to the types of perturbations we consider. It is defined for almost every matrix 
A G M„(F); given vectors Vi, . . . ,Vr, the new basis may be obtained by applying the Gram- 
Schmidt procedure to the first n vectors of the sequence 

Vi, . . . , Vr, Avi, . . . , AVr, A'^Vi, . . . , A^Vr, .... 

The result is a (2r + l)-diagonal matrix with positive outer diagonals. For Gaussian and 
null Wishart ensembles, the change of basis interacts well with the Gaussian structure; this 
observation goes back to Trotter (1984) in the r = 1 case. In the GO/U/SE case vi, . . . ,Vr 
are simply the initial coordinate basis vectors, while in the Wishart case they represent the 
first n rows of the data matrix X. As in Part I, the key observation is that the perturbations 
commute with the change of basis. 
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For the (unperturbed) Gaussian ensembles, the band form looks like 

'g 9* ■■■ 9* X 
9 9 9* ■■■ 9* X 

■■9 9 9* ■■■ 9* X 

g \ 9 ■■• ■■• ■■• 

X 9 '■■ 

X 9 

X '■■ 

where the entries are independent random variables up to the f-symmetry with g ~ A^(0, 2//3), 
g ~ FA^(0, 1), and x ~ ;^Chi((n — r — k)f3), k = 0,1,2, .. . going down the matrix. (Recall 
that if Z ~ ]RA^m(0, /) then \Z\ ~ Chi(m).) For the null Wishart ensemble, the form is best 
described as follows. One first obtains a lower (r + l)-diagonal form for the data matix X 
whose nonzero singular values are the same as those of X. It looks like 

X 

9 X 

'■ 9 X 

9 ■■ 9 '■■ 

X 9 '■ ■■• 

X 9 

X ■■■ 

where the entries are independent random variables with g ~ FA^(0, 1), x ~ :^Chi((n — A;)/3) 
and X ~ :^Chi((?T, — r — k)(3), k = 0, 1, 2, . . . going down the matrix. One then forms its 
multiplicative symmetrization, a (2r + l)-diagonal matrix with the same nonzero eigenvalues 
as X. In both cases the perturbations appear in the upper-left r x r block. Section 2 provides 
derivations. The obstacle to /3-generahzation at this level is the presence of F Gaussians in 
the intermediate diagonals. 

Proceeding with an analogue of the RRV convergence result hinges on reinterpreting 
these forms as block tridiagonal with r x r blocks. In Section 3 we develop an Mr(F)-valued 
analogue of the RRV technology, providing general conditions under which the principal 
eigenvalues and corresponding eigenvectors of such a random block tridiagonal matrix con- 
verge to a those of a continuum half-line random Schrodinger operator with matrix-valued 
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potential. As in Part I we allow for a general boundary condition at the origin. 

In Section 4 we apply this result to the band forms just described, proving a process 
central limit theorem for the potential and verifying the required tightness assumptions. 
The limiting operator turns out to be a multidimensional version of the stochastic Airy 
operator, which we now describe. 

First, a standard F Brownian motion {bt}t>o is a continuous F-valued random pro- 
cess with bo = and independent increments bf — bg ^ ¥N{0, t — s). (It can be described in 
terms of /3 = 1, 2 or 4 independent standard real Brownian motions.) A standard matrix 
Brownian motion {Bt}t>o has continuous M„(F)-valued paths with Bq = and indepen- 
dent increments Bt — Bs distributed as y/t — s times a GO/U/SE. The diagonal processes 
are thus \/2//3 times standard real Brownian motions while the off-diagonal processes are 
standard F Brownian motions, mutually independent up to symmetry. 

Finally, we define the multivariate stochastic Airy operator. Operating on the 
vector-valued function space L^(]R_|_, F'**) with inner product (/,(?) = Pg and associated 
norm ||/||^ = |/|^, it is the random Schrodinger operator 

n^ = -^ + V2B', + rx (1.1) 

where B'^ is "standard matrix white noise", the derivative of a standard matrix Brownian 
motion, and rx is scalar. (Here again /3 is restricted to the classical values, as the noise 
term lacks a straightforward /3-generalization.) The potential is thus the derivative of a 
continuous matrix-valued function; rigorous definitions will appear in Section 3 in a more 
general setting. 

For now it is enough to know that, together with a general self-adjoint boundary condition 

/'(O) = Wf{0), (1.2) 

the multivariate stochastic Airy operator is bounded below with purely discrete specturm 
given by a variational principle. Here W G Mr(F); actually, writing the spectral decomposi- 
tion W = J2i=i ua^iul, we formally allow Wi G {—oo, oo]. Writing /j = ulf, (1.2) is then to 
be interpreted as 

/;(0) = wji{0) for w,eR 

/j(0) = for Wi = +00. 

We write W e M*(¥) for this extended set and "H/j.iy for (1.1) together with (1.2). 

For concreteness, we record that the eigenvalues Aq < Ai < . . . and corresponding 
eigenfunctions /o, /i, • • • of His^w are given respectively by the minimum and any minimizer 
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in the recursive variational problem 

inf / {\f'f + rx\ff)dx + fiOyWfiO) + j. fdBJ. 

11/11=1, f±fo,...Jk-l 

Here canidates / are only considered if the first integral and boundary term are finite; the 
stochastic integral can then be defined pathwise via integration by parts. The eigenvalues 
and eigenfunctions are thus jointly defined random processes indexed over W. 

Remark 1.1. We note one important property of the eigenvalue processes, namely the path- 
wise monotonicity of in W with respect to the usual matrix partial order. This is 
immediate from the variational characterization and the fact that the objective functional is 
monotone in W. (For the higher eigenvalues it is most apparent from the standard min-max 
formulation of the variational problem.) 

We can now state the main convergence results. As outlined, Sections 2-4 furnish the 
proofs. One last shorthand: when we write that a sequence Wn G Mr{¥) tends to W & 
M*(F), we mean the following. Writing W = J2l=i'^i'^i'^\ "with Wi G (—00,00], one has 
Wn = J2l=i '^n,iUiu\ with Wn,i G M Satisfying Wn,i Wi for each i. In other words, the 
matrices are simultaneously diagonal and the eigenvalues tend to the corresponding limits. 

Theorem 1.2. Let A = Aq + \fnPn where Aq is an n x n GO/U/SE matrix and Pn = 
Pn ffi 0„_r with Pn G Mj.{¥), and let Xi > ■ ■ ■ > Xn be its eigenvalues. If 

n^/^{l- Pn) ^ W e M^iW) as 00 

then, jointly for k = 1,2, . . . in the sense of finite- dimensional distributions, 

n^^^ [Xk — 2^/nj =^ — Afc_i as n —)■ 00 

where Aq < Ai < . . . are the eigenvalues ofHjs^w Convergence holds jointly over {Pn}, W 
satisfying the condition. 

Theorem 1.3. Consider a p-variate real/ complex/ quaternion Wishart matrix with n degrees 
of freedom and spiked covariance S„^p = © Ip^^ > with S„p G Mr{¥), and let Xi > 
■ ■ • > XnAp be its nonzero eigenvalues. Writing nin^p = (n~^/^ if 

^n,p (1 - \AVp(S„,p - 1)) WeM*{¥) as 00 

then, jointly for k = 1,2, . . . in the sense of finite- dimensional distributions, 

2 

-^{Xk - {y/n + ^/p)^) =^ -Afc_i as n 00 
y/np V / 
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where Aq < Ai < . . . are the eigenvalues ofHpyi/- Convergence holds jointly over {S„^p}, W 
satisfying the condition. 

Remark 1.4. In the band basis described above, we also have joint convergence of the cor- 
responding eigenvectors to the eigenfunctions of T-ipy^. In detail, the eigenvectors should be 
embedded in L^(]R+) as step functions with step width n~^/^ in the Gaussian case and m~p 
in the Wishart case, and convergence is in law with respect to the norm topology. To be 
precise, one should use either subsequences or spectral projections; one could also formulate 
the joint eigenvalue-eigenvector convergence in terms of the norm resolvent topology. See 
Theorem 3.9 and the remark that follows. 

We now give the two promised alternative characterizations of the limiting eigenvalue 
laws. Fix (3 = 1, 2, 4 and W G M*(F) with eigenvalues — oo < i^i < . . . < w,. < oo. Writing 
P for the probability measure associated with T^/j^vk and its spectrum {Aq < Ai <...}, let 

wi, . . . , Wr) = P(-Afc < x) 

for k = 0, 1, . . .. Write simply = for the ground state distribution (limiting largest 
eigenvalue law). Once again, the generalization from Part I is not straightforward. The 
proofs are contained in Section 5, 

Theorem 1.5. Let Pxo,{wi,...,wr) be the measure on paths (pi, . . . ,pr) : [xq, oo) — )■ (— oo, oo]^ 
determined by the coupled diffusions 

dpi = -y^dbi + i rx - p"^ + — - — I dx (1.3) 

with initial conditions Pi{xo) = Wi and entering into {pi < ... < pr}, where bi,...,br 
are independent standard Brownian motions; particles Pi may explode to —oo in finite time 
whereupon they are restarted at +oo. Then 

Fp{x]Wi, ...,Wr)= Px/r,iwi,...,wr){^0 CXplosioUs). (1.4) 

More generally, 

Fp{x]Wi, . . . ,Wr) = Px/r,{wi,...,wr)i^'^ most k cxplosious). (1.5) 

We describe the diffusion more carefully in Section 5, asserting that it determines a 
law on paths valued in an appropriate space. Probabilistic arguments lead to the following 
reformulation in terms of its generator. 
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Theorem 1.6. -F^(x; Wi, . . . , Wr) is the unique bounded function F : M x — )■ M symmetric 
with respect to permutation of Wi, . . . ,Wr that satisfies the PDF 



dx 



dF 




+ 




dwi ) Wi — Wj \ dwi 




dwj 



dF 







(1.6) 



and the boundary conditions 



F ^0 



F 1 



as a; — 7- oo with wi, . . . ,Wr bounded below; 
as any Wi — )■ — oo with x bounded above. 



(1.7) 
(1.8) 



Furthermore, Fp is "continuous to the boundary" as one or several Wi +oo. For subsequent 
eigenvalue laws Fa{x]Wi, . . . , Wr), (1-8) is replaced with the recursive boundary condition 



At /3 = 2 these distributions were obtained in BBP in the form of Fredholm determinants 
of finite-rank perturbations of the Airy kernel. Baik (2006) derived Painleve II formulas, and 
by a symbolic computation with the computer algebra system Maple we were able to verify 
that the latter satisfy the PDE (1.6) for r = 2,3,4,5. We give the details in Section 6, 
Of course a pencil-and-paper proof for all r would be more illuminating and certainly more 
satisfying. 

We make two final remarks. From the finite n matrix models it is clear that the "rank r 
deformed" limiting distributions Fjs^r{x; Wi, . . . , Wr) reduce to those for a lower rank < r 
in the following way: 



Unfortunately this reduction relation is not readily apparent from any of our characteriza- 
tions (operator, SDK or PDE). 

Lastly, the SDK and PDE characterizations seem to make sense for all /3 > (although 
one has to be careful for f3 < 1). It would be interesting to find natural "general (3 multispiked 
models" at finite n, interpolating between those studied here at /3 = 1,2,4 and generalizing 
those introduced in Part I for r = 1. At /3 = 2, perhaps one could discover a relationship 
with formulas of Baik and Wang (2011). 



F^{x;wi,...,Wr) F^ ^(x*;Wi,...,w*_i,-hoo) 
as a: — )• X* G M, — )■ w* G M for i = 1, . . . , r — 1, and Wr — ?■ — oo. 



(1.9) 



Fp^r{x;Wi, . . . ,Wro,+OC, . . . ,+00) = Fp^ro{x]Wi 



,...,Wro). 
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2 A canonical form for perturbation in a fixed subspace 



In Part I we observed that the tridiagonal models of Gaussian and Wishart matrices were 
amenable to rank one perturbation. In this section we introduce a banded (also block 
tridiagonal) generalization amenable to higher-rank perturbation. We first describe it as 
a natural object of pure linear algebra; we then show how it interacts with the structure 
of Gaussian and Wishart random matrices to produce the band forms displayed in the 
introduction. 

The basic facts of "linear algebra over F", where F may be M, C or the skew field of 
quaternions H, are summarized in Appendix E of Anderson, Guionnet and Zeitouni (2009). 
Everything we need (inner product geometry, self-adjointness, eigenvalues, and the spectral 
theorem) simply works over EI as expected, keeping in mind only that non-real scalars may 
not commute. 

The band Jacobi form as an algebraic object 

We present a natural "canonical form" for studying perturbations in a fixed subspace of 
dimension r. It is a (2r -|- l)-diagonal band matrix generalizing the symmetric tridiagonal 
Jacobi form, which is the r = 1 case. The outermost diagonals continue to be positive; 
however, intermediate diagonals between the main and outermost ones are not in general 
real. Once again, the presence of F Gaussians is the obstacle to writing down a general /3 
analogue. 

We begin with a geometric, coordinate-free formulation. 

Theorem 2.1. Let T be a self-adjoint linear transformation on a finite- dimensional inner 
product space V of dimension n over F. An orthonormal sequence {wi, . . . C V with 
1 < r < n can he extended to an ordered orthonormal basis {vi, . . . ,f„,} for V such that 
{vi,Tvj) > for \i — j\ = r and {vi,Tvj) = for \i — j\ > r. Furthermore, if {vi,Tvj) > 
for \i — j\ =r then the extension is unique. 

The point is that the same extension works for T' = T + P provided P G M„ (IF) satisfies 
P|^^^ t, }-L ~ ^^^^ ^^^^ span{t>i, . . . ,Vr} is also an invariant subspace of P and we 

speak of perturbing in this subspace. 

Proof. We give an explicit inductive construction. Along the way we will see that the 
uniqueness condition holds precisely when the choice is forced at each step. 

It is convenient to restate the properties of the orthonormal basis in the theorem in 
the following equivalent way: For r -\- 1 < i < n we have {vi,Tvi-r) > and Tvi^r ^ 
span{t>i, . . . , Vi}. Suppose inductively that t>i, . . . , Vk-i have been obtained for some r + 1 < 
k < n, satisfying the preceding conditions forr-|-l<i</c — 1. Let w = Tvk-r] we 
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must choose Vk so that {v^.w) > and w G span{t>i, . . . , f^}. There are two cases to 
consider. If w ^ spanjwi, . . . , Vk-i} then Vk must be a multiple of w' = w — Yli=i {'^iy'^) "^jj 
the positivity condition further forces Vk = w'/ \w'\, which gives {vk,w) = \w'\ > 0. If w G 
span{vi, . . . , Vk-i} then any Vk G {fi, . . . , ffc-i}"*" will do, and in this case {vk, w) = 0. □ 

Remark 2.2. When uniqueness holds, as is generically the case, the basis may also be obtained 
by applying the Gram-Schmidt process to the first n vectors of the sequence 



We now state and prove a concrete matrix formulation in which the first r coordinate 
vectors play the role of Vi,...Vr- The point of the second proof is that it emphasizes the 
resulting band matrix rather than the change of basis; the algorithm will be used in the next 
subsection. 

Theorem 2.3. Let A G Mn{¥) and 1 < r < n. There there exists U G Un{^) of the form 
U = Ir®U with U G f/n-r(IF) such that B = UAW satisfies 



Furthermore, if strict positivity holds in (2.1) then U and B as such are unique. 

We refer to B as the band Jacobi form of A. The allowed perturbations here have 
the form P = P Q) On-r for P G Mr(¥); these are invariant under conjugation by U, so 



Proof. We prove existence by giving an explicit algorithm; it generalizes the Lanczos algo- 
rithm, which applies in the case r = 1. 

• For the first step, let v = [Ai^i]r+i<i<n G F"~^ and take U G t/„_r.(F) such that 
Uv = \v\ei, where ei is the first standard basis vector of F""''. A concrete choice is the 
Householder reflection U = In-r — ^wwyw'^w with w = v — \v\ei. Set Ui = Ir®U and 
Bi = UiAUl 

• Continue inductively: Having obtained Uk,Bk, let v = [(-Bi)i,(fe+i)]r+fe+i<i<n ^ 

and take If G Un-r-k{^) such that Uv = \v\ei. Set f/^+i = Ir+k © U and Bk+i = 
Uk+iBkUl^^. 

• Stop when k = n — r. Let U = Un~r ■ ■ - Ui and B = B^-r = U AU'^ . 



fl, . . .,Vr,Tvi, . . .,TVr,T'^Vi, . . . ,T'^Vr, .... 



Bij > for 1 < i, j < n with \i — j\ = t, 
Bij = for 1 < i, j < n with \i — j\ > r. 



(2.1) 
(2.2) 



U{A + P)W = B + P. 
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It is immediate that U and B have the required properties. The point is that the fcth column 
of Bk aheady "looks right", i.e. {Bk)r+k,k > and {Bk)r+i,k = for I > k, and subsequent 
transformations Uk+i, ■ ■ ■ , Un-k G {Ir+k} ® Un-r-ki^) "don't mess it up." 

Towards uniqueness, suppose that U',B' = U' AU''^ also have the required properties 
and let W = U'U~^ so that B' = WBW'^. Assume inductively that W e {Ir+k} © 
Un-r-k{^), which is certainly true in the base case k = 0. Write W = Ir+k © W. Let 
b = [-B,;,fc+i]r+fc+i<?;<n ^ JT"-'"-':: similarly for b'. Then b' = Wb. But b = aci and 
V = a'ei with a, a' > by assumption. It follows that a = a' and Wei = ci. Hence 
W G {/i} © f/n-r-(fc+i)(IF) and W E {Ir+k+i} © f4-r-(fc+i)(IF), completing the induction 
step. We conclude that W = In- □ 

Perturbed Gaussian and spiked Wishart models 

The change of basis described above interacts very nicely with the Gaussian structure in 
Gaussian and Wishart random matrices. The r = 1 case of this observation is due to Trotter 
(1984), who described the tridiagonal forms explicitly. His forms fall into the framework of 
Theorem 2.1 by taking the initial vector to be fixed in the Gaussian case, and taking it to be 
the top row of the data matrix in the Wishart case. As we observed in Part I, the change of 
basis commutes with rank one additive perturbations for the Gaussian case and with rank 
one spiking for the Wishart case. We now extend the story to the r > 1 setting. 

In the Gaussian case we will be perturbing in a fixed (non-random) subspace; without 
loss of generality this may be taken as the initial r-dimensional coordinate subspace, and so 
we take the basis of Theorem 2.1 that begins with the first r standard basis vectors. We 
can therefore obtain the band form by a direct application of the algorithm from the proof 
of Theorem 2.3. The Wishart case is a little more complicated; here we want to perturb in 
the random subspace spanned by the first r rows of the data matrix. Our new basis will 
begin with the Gram-Schmidt orthogonalization of these initial rows. As in the r = 1 case 
it is most transparent to construct a lower band form of the data matrix first, afterwards 
realizing the band Jacobi form as its multiplicative symmetrization. In both the Gaussian 
and the Wishart cases we will see that the uniqueness condition of Theorem 2.1 holds almost 
surely. 

Let A be an n X n GOE matrix. Applying the algorithm from the proof of Theorem 2.3 
while keeping track of the distribution of the matrix B^. at each step — the key of course 
being the unitary invariance of standard Gaussian vectors — yields the following band Jacobi 
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random matrix G = UAW: 




I = 



J 




< 



9ij j <i<j + r 

7^X(n-i+i)/3 i=j + r 



(2.3) 



i > j + r 



V 




t < J 



for 1 < i,j < n, where the random variables appearing exphcitly are independent, gi ~ 
A^(0, 1), Qij ~ FA^(0, 1), and Xk ~ Chi(fc). The latter is the distribution of the length of a 
/c-dimensional standard Gaussian vector. 

We can introduce a rank r additive perturbation A = Aq + \fnP ^ where P = P (B On-r 
with P G Mr(¥); since P commutes with the change of basis U G {Ir} © f/„_r(IF), we can 
write 



As expected the perturbation shows up undisturbed in the upper-left r x r corner of G 

Turning to the Wishart case, we first consider the null Wishart random matrix X'^X, 
where X is p x n with independent FA^(0, 1) entries. (Remember that X'^X and XX"^ have 
the same nonzero eigenvalues Ai, . . . ,XnAp-) The final form can be described abstractly as 
given in the basis of Theorem 2.1 that extends the Gram-Schmidt orthogonalization of the 
first r rows of X. One cannot readily obtain a description of the resulting random matrix 
from here, however, so we give another way that generalizes Trotter's original procedure. It 
is a "singular value analogue" of the algorithm from the proof of Theorem 2.3, producing 
matrices U G Un{^) and V G Up{¥) such that L = VXU has a "lower band form" that is 
zero off the main and first r subdiagonals and positive on the outermost of these. The key 
is to work alternately on rows and columns. 

• Take t/i G Un(¥) so that the first row of XUi lies in the (positive) direction of the first 
coordinate basis vector of F". 

• Take Vi = Ir Q) Up-r(¥) so that [{ViXUi)i^i]r+i<i<p G F^"*" lies in the direction of the 
first coordinate basis vector of the latter subspace. 

• Take f/2 G Ji © f/„-i(F) so that [{ViXUiU2)2,j]2<j<n e F""^ lies in the direction of the 
first coordinate basis vector of the latter subspace. 



• Take V2 G Ir+i © t/p„^_i(F) so that [iV2ViXUiU2)i,2]r+2<j<p e Fp^'-^ lies in the direc- 
tion of the first coordinate basis vector of the latter subspace. 



G = UAU^ = U{Ao + VnP)U^ = UAqU^ + V^P = Go + V^P. 



(2.4) 
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• Continue in this way until the rows and columns both run out (stop alternating if one 
runs out before the other). 

The resulting L = VnA{p-r) ■ ■ ■ ViXUi ■ ■ ■ f/„Ap has n A p nonzero columns and {n + r) A p 
nonzero rows, which can be described as follows: 



9ij j <i<j + r 

7^X(p-i+i)/3 i=J+r 

i<joTi>j + r 



(2.5) 



where the entries are independent, Xk, Xk ~ Chi(A;), gij ~ FA^(0, 1). Truncating the remain- 
ing zero rows or columns, the matrix S = L'^L is {n Ap) x {n Ap) and has the same nonzero 
eigenvalues as X^X. It has the band form 



S.. 



/3^(n-j+l)/3 + X]j<fc<j+r \9k,i\ + ^X(p_i_r+1)^ 



I = J 

-^X(n-i+l)p9ij + Y.i<k<j+r 9*k,i9kJ + -^9*j+r,iX{p-j-r+l)fi j <i <j +r 
lXin-i+l)l3Xip-i+l)l3 i = j + r 

i > j + r 

i < j 



(2.6) 



where we have ignored the issue of truncation in the final r rows and columns {g's and 
x's with indices beyond the allowed range should simply be zero). The change of basis is 
thus Ui - ■ ■ UnAp] a little thought shows that, as claimed earlier, the new basis begins with 
the orthogonalization of the first r rows of X. Since the form (2.6) satisfies the uniqueness 
condition of Theorem 2.1 a.s., the basis is indeed the one given by the theorem. 

Now we consider the spiked Wishart matrix X'^X = XqSXq, with S = flQlp-r > 0. Here 
Xo is a null Wishart matrix and X = J^^^^Xq. Notice that X^X-X^Xq = X^ ((S-I,)©0)Xo 
is indeed an additive perturbation in the subspace spanned by the first r rows of Xq. Since 
S"*^/^ = S"*^/^ © / commutes with the inner transformation V E {Ir} © Up^r(^), we have 

L^L = u^x^xu = u^xli:XoU = u^xlv^i:vXoU = lJsLo 

where L = VXU and Lq = VXqU. The point is that same change of basis works in the 
rank r spiked case, and by the lower band structure of Lq, the perturbation shows up in the 
upper-left r x r corner: 



S — 5*0 — L — lI^Lq 



Ir)Lo © 0. 



(2.7) 
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Viewed in terms of the algorithm used to produce L, the point is that the first r rows of X 
are never "mixed" together or with the lower rows, but only "rotated" within themselves. 

3 Limits of block tridiagonal matrices 

The banded forms of Section 2 may also be considered as block tridiagonal matrices with 
r X r blocks. In this section we give general conditions under which such random matrices, 
appropriately scaled, converge at the soft spectral edge to a random Schrodinger operator on 
the half-line with r x r matrix-valued potential and general self-adjoint boundary condition 
at the origin. In Section 4 we verify these assumptions for the two specific matrix models 
we consider. 

Proposition 3.7 establishes that the limiting operator is a.s. bounded below with purely 
discrete spectrum via a variational principle. The main result is Theorem 3.9, which asserts 
that the low- lying states of the discrete models converge to those of the operator limit. 

The scalar r = 1 case of Part I, based in turn on RRV, serves as a prototype. Care 
is required throughout to adapt the arguments to the matrix-valued setting, and we give a 
self-contained treatment. 

Discrete model and embedding 

Underlying the convergence is the embedding of the discrete half-line = {0, 1, . . .} into the 
continuum M.^ = [0, oo) via j t-)- j/nin, where the scale factors m„ — )■ oo but with m„ = o{n). 
Define an associated embedding of vector-valued function spaces by step functions: 

£^(Z+,F'-) ->L2(M+,F'^), (vo^vu...) ^ vix)=vi^„,i, 

which is isometric with i"^ norm ||f ||^ = m^^ Xljlo I'^il^- (R-^call that and have norms 
\vf = v'^v and ||/||^ = \ff respectively.) Fix a standard basis for £^ with lexicographic 
ordering 

(ei, 0, . . .), (ea, 0, ...),... , (e^, 0, . . .), (0, ei, 0, ...),.. . 

where 6]^^ • • • IS the standard basis for F*". Identify F" with the n- dimensional initial 
coordinate subspace of if^, consisting of F^-valued step-functions supported on the interval 
[O, [n/r] /m„) and with the final step value in the subspace spanned by ei, . . . , er-{\n/r]r-n)- 
Our n X n matrices will act on F" with respect to the above basis; we will generally assume 
the embedding F" C £^ implicitly. 

We define some operators on L^, all of which leave i"^ invariant and may also be considered 
as infinite block matrices with r x r blocks. The translation operator (T„/)(a;) = f{x + m~^) 
extends the left shift on i"^. Its adjoint is the right shift, where T^/ = on [0, m~^). The 
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difference quotient D„ = m„(T„ — 1) extends a discrete derivative. Write diag(Ao, Ai, . . .) for 
both an r X r block diagonal matrix and its extension to a pointwise matrix multiplication on 
L^. Thus En = diag(m„J,., 0, 0, . . .) is scalar multiplication by m„ljQ a "discretized delta 
function at the origin". Orthogonal projection from i"^ onto F*^ extends to a multiplication 
Rn = diag(/r., . ■ . ,Ir, diag(l, . . . , 1, 0, . . . , 0), 0, . . .), in which there are \n/r] non-zero blocks 
and a total of n I's. 

Let i = 1,2 he two discrete-time r x r matrix- valued random processes with 

^n,i;j € Mr{¥) ioT all j. The processes may be embedded into continuous time as above, 
by setting = Yn,i-irn„x\- Note also that T„ and A„ = m„(l - T^) = -D^ may be 

sensibly applied to such matrix-valued functions. The processes Yn^i are on- and off-diagonal 
integrated potentials, and we define a "potential operator" by 

K = diag(A„y„,i) + i(diag(A„r„,2)T„ + Tidiag(A„yJ_2)). (3.1) 

Fix Wn G Mr{¥), a nonrandom "boundary term". 
Finally, consider 

Hn = Rn{DiDn + Vn + WnEn)Rn. (3.2) 

This operator leaves the initial coordinate subspace F" invariant; we shall also use iJ„ to 
denote the matrix of its restriction to F". The matrix if„ G M„(F) is self-adjoint and 
block tridiagonal up to a truncation in the lower-right corner. Its main- and super-diagonal 
processes are 

m^ + m„(iy„ + F„,i;o), 2m^ + m„(F„,i;i - y„,i;o), 2m^ + m„(y„,i.2 - y„,i;i), ... 

-ml + ^m„K„,2;0, + \mn{Yn,2;l - Yn,2-fl), ■ ■ ■ (3.3) 

respectively; the sub-diagonal process is of course the conjugate transpose of the super- 
diagonal process. (We could have absorbed Wn into as an additive constant, but keep 
it separate for reasons that will soon be clear. Note also that the upper-left block has ml 
rather than 2ml.) refer to Hn as a rank r block tridiagonal ensemble. 

As in RRV and Part I, convergence rests on a few key assumptions on the potential and 
boundary terms just introduced. By choice, no additional scaling will be required. The role 
of the convergence in the first and third assumption below will be clear as soon as we define 
the continuum limit. The growth and oscillation bounds of the second assumption (and the 
lower bound imphed by the third) ensure tightness of the low-lying states; in particular, they 
guarantee that the spectrum remains discrete and bounded below in the limit. 
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Assumption 1 (Tightness and convergence) . There exists a continuous Mr(F)-valued ran- 
dom process {Y{x)}x>o witli Y{0) = such that 

{Yn,i{x)}a:>o, i = 1,2 are tight in law, 
Yn,i + UYn,2 + Yl,) Y inlaw 

with respect to the compact-uniform topology (defined using any matrix norm). 

Assumption 2 (Growth and oscillation bounds). There is a decomposition 

j 

A:=0 

(so AnYn^i = rin,i + A„a;„^j) with rjn,r,j > (as matrices), such that for some deterministic 
scalar continuous nondecreasing unbounded functions f]{x) > 0, ({x) > 1 not depending on 
n, and random constants > 1 defined on the same probability spaces, the following hold: 
The tight in distribution, and for each n we have almost surely 

f]{x)/Kn-Kn < r]n,lix) + r]n,2{x) < Kn{l+f]{x)), (3.6) 

Vn,2ix) < 2ml (3.7) 

\uJn,l{0 - '^n,l{x)f + \(^n,2{0 - ^n,2{x)f < K„ (l + ?7(a;)/C(x)) (3.8) 

for all x,^ G [0, [n/r]/m„) with |^ — a;| < 1. Here matrix inequalities have their usual 
meaning and single bars denote the spectral (or £^(F^) operator) norm. 

Assumption 3 ( Critical or suhcritical perturbation). For some orthonormal basis ui, . . . ,Ur 
of F'' and — oo < Wi < . . . < < oo we have Wn = Y7i=i '^n,iUiu\, where Wn,i € M satisfy 
lim^^oo Wn,i = Wi for each i. 

We write ro = : Wi < oo} G {0, . . . ,r} for the "critical rank". Formally, Wn W = 
J2l=i '^i'^i'^l ^ M*{¥). It is natural to view as a parameter: that is, we will consider the 
joint behaviour of the model (for given Yn^i, Y) over all Wn, W satisfying Assumption 3. 

Reduction to deterministic setting 

In the next subsection we will define a limiting object in terms of Y{x) and W; we want 
to prove that the discrete models converge to this continuum limit in law. We reduce the 
problem to a deterministic convergence statement as follows. First, select any subsequence. It 
will be convenient to extract a further subsequence so that certain additional tight sequences 
converge jointly in law; Skorokhod's representation theorem (see Ethier and Kurtz 1986) says 
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this convergence can be realized almost surely on a single probability space. We may then 
proceed pathwise. 

In detail, consider (3.4)-(3.8). Note in particular that non-negativity of the rjn^i and the 
upper bound of (3.6) give that for i = 1,2 the piecewise linear process {Jo Vn,i} is tight 
in distribution, pointwise with respect to the spectral norm and in fact compact-uniformly. 
Given a subsequence, we pass to a further subsequence so that the following distributional 
limits exist jointly: 



Y ^ Y 



JoVn,i Vi, (3-9) 

for i = 1,2, where convergence in the first two lines is in the compact-uniform topology. 
We realize (3.9) pathwise a.s. on some probability space and continue in this deterministic 
setting. 

We can take (3.6)-(3.8) to hold with Kn replaced with a single k. Observe that (3.6) gives 
a local Lipschitz bound on the J rjn^i, which is inherited by their limits rji (the spectral norm 
controls the matrix entries). Thus rji = rji is defined almost everywhere on IR+, satisfies (3.6), 
and may be defined to satisfy this inequality everywhere. Furthermore, one easily checks 
that ^n^^Vn,i ~^ J Vi compact-uuiformly as well (use continuity of the limit). Therefore 
<^n,i = yn,i — ^n^J2Vn,i must havc a continuous limit Ui for i = 1,2; moreover, the bound 
(3.8) is inherited by the limits. Lastly, put rj = rji + ri2, u = Ui + ^{uj2 + ^l) note that 
Yi = J 7]i + Ui and Y = J rj + u. For convenience, we record the bounds inherited by rj, u: 

fj{x)/K — K < r]{x) < K(l + rj(x)) (3.10) 

HO - < <^ + vix)/ax)) (3.11) 

for X, G IR+ with |^ — a;| < 1 (and note that k > 1). 

We will assume this subsequential pathwise coupling for the remainder of the section. 

Limiting object and variational characterization 

Formally, the limiting object is the eigenvalue problem 

'Hf = Af onL\R^,r) 
f{0) = Wf{0) 

where 
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Writing the spectral decomposition W = Y7i=i ^j^j^I) recall (Assumption 3) that we actually 
allow G M for 1 < i < and, symbolically, Wi = +00 for r^ + l < i < r. Writing fi = u\f , 
the boundary condition is then to be interpreted as 



We thus have a completely general homogeneous linear self-adjoint boundary condition. We 
refer to spanj^j : 2 > tq} as the Dirichlet subspace and the corresponding /, as Dirichlet 
components; they will require special treatment in what follows. 

We will actually work with a symmetric bilinear form (properly, sesquilinear if F = C 
or H) associated with the eigenvalue problem (3.12). Define a space of test functions 
consisting of smooth F'^- valued functions if on IR^ with compact support; we additionally re- 
quire the Dirichlet components to be supported away from the origin. Introduce a symmetric 
bilinear form on x by 



where the Dirichlet part of the last term is interpreted as zero. Formally, the form 'H(-, ■) 
is just the usual one {-.l-i-) associated with the operator "H; the potential term has been 
integrated by parts and the boundary condition "built in". See also Remark 3.5 below. 

The regularity and decay conditions naturally associated with this form are given by the 
following weighted Sobolev norm: 



where the positive part of W is defined as = X]i=i ^t^i"^]. with w'^ = w V 0. (Define 
the negative part similarly with = —{wAO), so that W = — W~ .) We refer to 
||-||^ as the L* norm and define an associated Hilbert space L* as the closure of under 
this norm. (The formal Dirichlet terms are again interpreted to be zero, but they can also 
be thought of as imposing the Dirichlet condition.) We record some basic facts about L* . 

Fact 3.1. Any f E L* is uniformly Holder (l/2)-continuous and satisfies |/(x)|^ < 2 ||/'|| ||/|| 
< 11/11! for all x; furthermore, /i(0) = for i > tq. 



Proof We have \f{y)-f{x)\ = |/Jf | < ||/'|| \y - x\^^\ For / e we have \ f{x)f = 
— 2 Re /'''/' < 2 11/11 1 1 /'I I < 11/11^; an L*-bounded sequence in therefore has a compact- 



uniformly convergent subsequence, so we can extend this bound to / G L* and also conclude 



/;(0) = Wif.iO) for t < ro, 
/i(0) = for z > ro. 



(3.13) 



niv,^) = {v',^')-{^',Ytlj)-{^,Ytlj') + ^{Oywm 



(3.14) 




(3.15) 



the behaviour in the Dirichlet components. 



□ 
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Fact 3.2. Every L* - bounded sequence has a subsequence converging in the following modes: 
(i) weakly in L* , (ii) derivatives weakly in L^, (Hi) uniformly on compacts, and (iv) in . 

Proof, (i) and (ii) are just Banach-Alaoglu; (iii) is the previous fact and Arzela-Ascoli again; 
(iii) implies convergence locally, while the uniform bound on J fi\fn\^ produces the uniform 
integrability required for (iv). Note that the weak limit in (ii) really is the derivative of the 
limit function, as one can see by integrating against functions l[o,x] and using pointwise 
convergence. □ 

By the bound in Fact 3.1 with a; = 0, the boundary term in (3.15) could be done away 
with. It is natural to include the term, however, when considering all W simultaneously and 
viewing the Dirichlet limiting case. More importantly, it clarifies the role of the 

boundary terms in the following key bound. 

Lemma 3.3. For every < c < 1/k there is a C > such that, for each b > 0, the following 
holds for all W > -b and all f e : 

c\\f\\l-{l + b')C\\ff < H{fJ) < CWfWl (3.16) 

In particular, 7{(-, ■) extends uniquely to a continuous symmetric bilinear form on L* x L* . 

Proof. For the first three terms of (3.14), we use the decomposition Y = J rj + u from the 
previous subsection. Integrating the J rj term by parts, (3.10) easily yields 

^ll/ll!-MI/f < \\ft + {f,vf) <^\\f\\l 

Break up the u term as follows. The moving average uix = J^~^^ ^ is differentiable with 
uj'^ = oJx+i — oJx] writing u = u + {u — co), we have 

-2 Re(/', uf) = (/, Lj'f) + 2 Re(/', {u - u)f). 

By (3.11), max(|(X'^ — u^l , \u)^ — u)xf) < C'e + ef]{x) for |^ — x| < 1, where e can be made 
small. In particular, the first term above is bounded absolutely by e ||/||^ + ||/||^. Av- 
eraging, we also get — uJx\ < (C^ + ef]{x)Y^'^; Cauchy-Schwarz then bounds the second 
term absolutely by ^i/J" |/f + ^ J^°^ (C, + ef]) \f\' and thus by ^ \\f\\l + ||/f . Now 
combine all the terms and set e small to obtain a version of (3.16) with the boundary terms 
omitted (from both the form and the norm). 

We break the boundary term in (3.14) into its positive and negative parts. For the 
negative part. Fact 3.2 gives |/(0)|^ < (e/6) ||/'f + (b/e) ||/f ; W' <b then implies that 

< /(o)tiy-/(o) < 5 11/11^ + 0^11/11% 
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which may be subtracted from the inequahty already obtained. For the positive part 
f{OyW~^f{0), use the fact that c < 1 < C to simply add it in. We thus arrive at (3.16). 

For the L* bilinear form bound, begin with the quadratic form bound \'H{f,f)\ < 
^c,b 11/11*; it is a standard Hilbert space fact that it may be polarized to a bilinear form 
bound (see e.g. §18 of Halmos 1957). □ 

Definition 3.4. We say / G L* is an eigenfunction with eigenvalue A if / 7^ and for 

all ip G we have 

H{^J) = A{^J). (3.17) 
Note that (3.17) then automatically holds for all ip G L*, by L*-continuity of both sides. 

Remark 3.5. This definition represents a weak or distributional version of the problem (3.12). 
As further justification, integrate by parts to write the definition 

{^', f) - (^', Yf) - (^, Yf) + m^WfiQ) = A (y., /) 

in the form 

{V\ f) - {V\ Yf) + (^', Yf) - (^', Wm) = -A (^', /> , 
which is equivalent to 

fix) = Wf{0) + F(x)/(x) - /V/' - A n a.e. x. (3.18) 

JO Jo 

(For a Dirichlet component fi the restriction on test functions implies that {(f'^, 1) = 0, 
so the first boundary term on the right-hand side is replaced with an arbitrary constant.) 
Now (3.18) shows that /' has a continuous version, and the equation may be taken to hold 
everywhere. In particular, / satisfies the boundary condition of (3.12) classically. (For 
a Dirichlet component we just find that the arbitrary constant is /j'(0).) One can also 
view (3.18) as a straightforward integrated version of the eigenvalue equation in which the 
potential term has been interpreted via integration by parts. This equation will be useful in 
Lemma 3.6 below and is the starting point for the development in Section 5, 

We now characterize the eigenvalues and eigenf unctions variationally. As usual, it follows 
from the symmetry of the form that eigenvalues are real (and eigenfunctions with distinct 
eigenvalues are L^-orthogonal). The part of the lower bound in (3.16) says the spectrum 
is bounded below. The rest of (3.16) implies that there are only finitely many eigenvalues 
below any given level: a sequence of normalized eigenfunctions with bounded eigenvalues 
must have an L^-convergent subsequence by Fact 3.2, At a given level, more is true: 

Lemma 3.6. For each A G M, the corresponding eigenspace is at most r- dimensional. 
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Proof. By linearity, it suffices to show a solution of (3.18) with /'(O) = /(O) = must vanish 
identically. Integrate by parts to write 

fix) = Yix) fr - - Ax ff + A rtf'it)dt, 

Jo Jo Jo Jo 

which implies that |/'(a;)| < C{x) \ f'\ with some C{x) < oo increasing in x. Gronwall's 
lemma then gives = for all x > 0. □ 

Proposition 3.7. There is a well-defined {k + l)st lowest eigenvalue Ak, counting with 
multiplicity. The eigenvalues Aq < Ai < ... together with an orthonormal sequence of 
corresponding eigenvectors /o,/i, • • • are given recursively by the variational problem 

/-L/ov/fe-i 

in which the minimum is attained and we set fk to be any minimizer. 

Remark 3.8. Since we must have A^ — t- oo, {Ao,Ai,...} exhausts the spectrum and the 
resolvent operator is compact. We do not make this statement precise. 

Proof. First taking k = 0, the infimum A is finite by (3.16). Let be a minimizing sequence; 
it is L*-bounded, again by (3.16). Pass to a subsequence converging to / G L* in all the 
modes of Fact 3.2, In particular 1 = — ||/||, so > A by definition. But also 

nf, f) = \\ff + if, Vf) + (/, oJ'f) + 2 Re(/', {uj - oo)f) + fiO^WfiO) 
< liminf H(/„,/„) 

n— >oo 

by a term-by-term comparison. Indeed, the inequality holds for the ffist term by weak 
convergence, and for the second term by pointwise convergence and Fatou's lemma; the 
remaining terms are just equal to the corresponding limits, because the second members of 
the inner products converge in by the bounds from the proof of Lemma 3.3 together with 
L*-boundedness and L^-convergence. Therefore /) = A. 

A standard argument now shows (A, /) is an eigenvalue-eigenfunction pair: taking ip G 
and e small, put = {f + eip)/\\f + e(p\\; since / is a minimizer, ^\^^Q'H{f^ , f^) must 
vanish; the latter says precisely (3.17) with A. Finally, suppose {A,g) is any eigenvalue- 
eigenfunction pair; then 'H{g,g) = A, and hence A < A. We are thus justified in setting 
Ao = A and /o = /. 

Proceed inductively, minimizing now over the orthocomplement {/ G L* : ||/|| = 1,/ ± 
/o, . . . fk-i}- Again, L^-convergence of a minimizing sequence guarantees that the limit 
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remains admissible; as before, the limit is in fact a minimizer; conclude by applying the 
arguments of the previous paragraph with y), g also restricted to the orthocomplement. □ 

Statement 

We are finally ready to state the main result of this section. Recall that we consider eigen- 
vectors of a matrix Hn G M„(F) in the embedding F" C ^2(z+,F^) ^ L2(M+,F'^) above. 

Theorem 3.9. Let Hn he a rank r block tridiagonal ensemble as in (3.2) satisfying As- 
sumptions 1-3, and let Xn.k be its {k + l)st lowest eigenvalue. Define the associated form 7i 
as in (3.14) and let be its a.s. defined {k + l)st lowest eigenvalue. In the deterministic 
setting of subsequential pathwise coupling, Xn,k for each k = 0,1, . . .. Furthermore, a 

sequence of normalized eigenvectors corresponding to Xn,k is pre-compact in norm, and 
every subsequential limit is an eigenfunction corresponding to A^. Finally, convergence holds 
uniformly over possible Wn,W > —b > — oo. One recovers the corresponding distributional 
tightness and convergence statements for the full sequence, jointly for k = 0,1, ... in the sense 
of finite- dimensional distributions and jointly over Wn,W. 

Remark 3.10. The eigenvector convergence statement requires subsequences for two reasons: 
possible multiplicity of the limiting eigenvalues, and the sign or phase ambiguity of the 
eigenvectors. It is possible to formulate the conclusion of the theorem very simply using 
spectral projections. (If H has purely discrete spectrum, the spectral projection 1a{H) is 
simply orthogonal projection of onto the span of those eigenvectors of H whose eigen- 
values lie in y4 C M.) The joint eigenvalue-eigenvector convergence may be restated in 
the deterministic setting as follows: For all a & M.\ {Aq, Ai, . . .}, the spectral projections 
l(-oo,a)(-f^n) — > l(-oo,a)('H) in opcrator norm. The corresponding distributional state- 
ment holds jointly over all a that are a.s. off the limiting spectrum (or simply all a if the 
distributions of the A^ are non-atomic). 

Remark 3.11. An operator-theoretic formulation of the theorem (which we do not develop 
here) would state a norm resolvent convergence: the resolvent matrices, precomposed with 
the finite-rank projections — > F" associated with the embedding, converge to the contin- 
uum resolvent in operator norm. This mode of convergence is the strongest one can hope 
for in the unbounded setting (see e.g. VIII. 7 of Reed and Simon 1980, Weidmann 1997). 

The proof will be given over the course of the next two subsections. 
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Tightness 

We will need a discrete analogue of the L* norm and a counterpart of Lemma 3.3 with 
constants uniform in n. For w G F" L^(]R+,F^) as above, define the L* norm by 



oo 



[DiDn + 1 + + EnW^) v) 

1 2^ + (3-19) 

{\Dnvf + (1 + rj) \vf) + viSifW^vi^)) 

with the non-negative part defined as before. 

Remark 3.12. When considering just a single Wn,W, the boundary term in (3.19) is really 
only required when the limit includes Dirichlet terms; it is simpler, however, not to dis- 
tinguish the two cases here. More importantly, including this term clarifies the role of the 
boundary term in the following key bound. Note that the original case considered in RRV 
has Wn = TTLn in our notation. (The if„ form and L* norm there contained a term m„ |fo|^, 
though it is hidden in the fact that, in our notation, they use A„ in place of Dn-) 

Lemma 3.13. For every < c < 1/4k there is a C > such that, for each b > 0, the 
following holds for all n, Wn > —h and f G F".- 

c\\v\\l^-{l + h^)C\\v\\' < {v,Hnv) < C\\v\\l^. (3.20) 

Proof. We drop the subscript n. The form associated with (3.2) is 

{v, Hv) = \\Dvf + {v, Vv) + v{OyWv{0). (3.21) 

The potential term {v,Vv) = v'^Vv, defined in (3.1), is analyzed according to (3.5): 

v^Vv = v\AYi)v + Rev\AY2)Tv 

= [v^rjiv + Rev^ri2Tv) + [v\Aui)v + Ref^(Aa;2)Tt>). 

Together with |-D„f | , the ?7-terms provide the structure of the bound as we now show. 
Afterwards we will control the w-terms and lastly deal with the boundary term. 

Recall (3.6) and that rji > 0. For an upper bound, rearrange {v — Tv)'^ri2{v — Tv) > to 

Rev'^r]2Tv < \v'^r]2V + \ {Tv)'^r]2Tv 

< \K{f)+l) {\v\^ + \Tv\^) . 

Now J fj \ Tv\^ = J {T'^fj) \vf' < j fi\v^ since fj is nondecreasing, and we obtain 

\\Dvf + {v, riiv) + Re {v, r]2Tv) < 2k \\v\\l . (3.22) 
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Toward a lower bound we use the slightly tricky rearrangement < {^v + Tvyri2{^v + Tv) = 
3Rev^ri2Tv + (Tv — vyri2(Tv — v) — |f^?72f • With (3.7) we get 

Rev''ri2Tv > — |(Tt> — v)'^r]2{Tv — v) + \v'^'r]2V 
> -| \Dvf + \v^r]2V, 

so by (3.6), 

\Dvf + v'^rjiv + Rev^ri2Tv > | \Dvf + k — k) \v\^ 

and thus 

\\Dvf + {v,T]iv) +Re{v,r]2Tv) > (1/4k) \\v\\l - (k/4) \\v\\^ . (3.23) 

We handle the w-terms with a discrete analogue of the decomposition used in the con- 
tinuum proof. Consider the moving average 

LmJ 

i=i 

which has AaJ, = (m/ [m\)(T^"^^ — l)^^; it is convenient to extend UJi{x) = Ui{\n/r~\/mn) for 
X > \n/r]/mn. Decompose Ui =Ui + {ui — cJj). For the wi-term, 

f^AtJif = (m/[mJ)f^(T cJi — u}i)v + w^A(a;i — uJi)v. 

By (3.8) and Cauchy-Schwarz, the first term is bounded absolutely by [C^ + ef]) \vf and its 

1 1 2 1 1 2 m 

integral by £ ||f ||^ + ||f || . The second term calls for a summation by parts: 

(v, A{uji - uji)v) = rUn {{v, (wi - uji)v) - (Tv, {ui - uJi)Tv)) 

= rrin Re(f — Tv, (wi — uJi){v + Tv)') = Re(^Dv, (pi — uJi)(v + Tv)). 

The averaged bound \cJi — uJi\ < (C^ + efj)^/'^ and Cauchy-Schwarz bound the integrand 

I {Dv)\uji - uji){v + Tv)\<^ \Dv\^ + {l/A^){C, + efi){\v\^ + \Tv\^), 

and its integral by ^/e ||f ||^ + lli^H . One thus obtains a similar bound on \ (v, 

There are corresponding bounds for the C(;2-terms. For the a;2-term, use 2|f| \Tv\ < 
\v\^ + iTt;!^. For the {uj2 — cJ2)-term, modify the summation by parts: 

Re{{v, A(w2 - oJ2)Tv) = m^Re [{{v - Tv), {lo2 - uj2)Tv) + {Tv, (cj2 - ^2){Tv - T'^v))) 

= Re{Dv + TDv, {002 - uj2)Tv). 
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Incorporating all the w-terms into (3.22, 3.23) and setting e small, we obtain (3.20) but with 
the boundary terms omitted (from both the form and the norm). 

We break the boundary term in (3.21) into its positive and negative parts. A discrete 
analogue of a bound from Fact 3.1 will be useful: 

poo poo 

\v{0)f = / -D\vf= / Rem{v -TvYiv + Tv) <2\\Dv\\\\v\\ . 
Jo Jo 

It gives 1^^(0)1^ < {e/b) \\Dv\f + {b/e) \\v\f, and then W~ < b implies that 

< v{OyW-v{0) < e\\v\\l + C'^b'^\\v\f 

which may be subtracted from the inequality already obtained. The positive part may simply 
be added in using that c < 1 < C. We thus arrive at (3.20). □ 

Remark 3.14. If the Wn are not bounded below then the lower bound in (3.20) breaks down: 
in fact, the bottom eigenvalue of Hn really goes to — cxd like minus the square of the bottom 
eigenvalue of Wn- This is the supercritical regime. 

Convergence 

We begin with a simple lemma, a discrete-to-continuous version of Fact 3.2, 

Lemma 3.15. Let fn G F" with ||/nL„ uniformly bounded. Then there exist f E L* and 
a subsequence along which (i) fn — > / uniformly on compacts, (ii) fn -^l^ f, and (Hi) 
Dnfn f weakly in . 

Proof. Consider gn{x) = /n(0) + Dnfn, a piecewise-linear version of /„; they coincide at 
points X = i/nin, i € Zf+. One easily checks that Hf^nll^ < 2 so some subsequence 

— )■ / G in all the modes of Fact 3.2; for a Dirichlet component, the boundary term in the 
L* norm guarantees that the limit vanishes at 0. But then also fn^f compact-uniformly 
by a simple argument using the uniform continuity of /, fn -^l^ f because — Qnlf < 
{l/3n^) \\Dnfn\f, and /' weakly in because = g'^ a.e. □ 

Next we establish a kind of weak convergence of the forms to ?/(■,■). Let Vn 

be orthogonal projection from onto F"^ embedded as above. The following facts will be 
useful and are easy to check. For / G L^, Vnf -^l'^ f (the Lebesgue differentiation theorem 
gives pointwise convergence and we have uniform L^-integrability) ; further, if /' G L^ then 
Dnf — >l2 /' {Dnf is a convolution of /' with an approximate delta); for smooth yj, P„(y9 — )■ ip 
uniformly on compacts. It is also useful to note that Vn commutes with i?„ and with DnRn- 
Finally, if /„ -^^2 /, gn is L^-bounded and gn ^ g weakly in L^, then (/„,5'„) -> {f,g). 
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Lemma 3.16. Let fn ^ f be as in the hypothesis and conclusion of Lemma 3.15. Then for 
all if G we have {(f, Hnfn) "^(v^? /)■ -^''^ particular, Vnf f in this way and so 

{VnV, HnVnV) = ^n^n^) ^ V?). (3.24) 

Proof. Since (p is compactly supported we have Rn'-p = f ioi n large and the -R„s may be 
dropped. By assumption -D„/„ is bounded and Dnfn — > /' weakly in L^, so by the 
preceding observations Dn^p — )-l2 ip' and 

For the potential term we must verify that 

Vnfn) = (a„F„,i + |((A„r„,2)T„ + T„t(A„ri^2))) fn) 

converges to — {ip',Yf) — {ip,Yf'). Recall by Assumption 1 (3.4) and (3.9) that — )■ Yi 
compact-uniformly {i = 1, 2) and Y = ^1 + ^(^2 + ^2^)- Writing y„ = y„,i + |(F„,2 + >"„^,2) ^ Y 
(and disregarding the notational collision with Yj), we first approximate Vn by Al^: 

(V9, (A„F„)/„) = m„((v9,F„/„) - {Tn(p,YnTnfn)) 

= mn{{ip,Ynfn) " {Tn^.Ynfn) + {Tnip,Ynfn) - {Tn^^YnT^fn)) 
= - - (T„V2, YnDnfn) , 

which converges to the desired limit by the observations preceding the lemma together with 
the assumptions on /„ and the fact that T„y9 -^12 ip in since m„ ||T„v^ ~ v\\ = ll-^nV^II 
is bounded. The error in the above approximation comes as a sum of T„ and T\ terms. 
Consider twice the T„ term: 

|(<^,(A„y„,2)(T„-l)/n)| = \{vAm-^^nYn,2)Dnfn)\ 

< \\ip\\ sup |F„,2 - TlYn,2\ \\Dnfn\\ 
I 

where J is a compact interval supporting (p. (The single bars in the supremum denote the 
spectral or ^2-operator norm, which is of course equivalent to the max norm on the entries.) 
Note that -D„/„ is L^-bounded because it converges weakly in L^. Now Yn^2 and ^^^,2 both 
converge to Y2 uniformly on /, in the latter case by the uniform continuity of Y2 on /; it 
follows that the supremum, and hence the whole term, vanish in the limit. The term is 
handled similarly, the only difference being that the Dn in the estimate lands on ip instead. 



27 



Finally, for the boundary terms Assumption 3 gives 



where in the Dirichlet case i > tq the left side vanishes for n large because is supported 
away from 0. 

Turning to the second statement, we must verify that Vn^ — ?■ as in Lemma 3.15, 
The uniform L* bound on Vn'^ follows from the following observations: || (P„V^) vTH-^|| = 
||P„(y9v^l + fj|| < + r^ll; for n large enough that Rn(p = (p we have = 

< ||-Dri(/?|| < llv^'ll (Young's inequality); for the boundary term note that (Vn'^)i{0) 
is bounded if i < ro and in fact vanishes for n large if i > tq. The convergence is easy: 
— )■ (f compact-uniformly and in L^, and for g G L"^ we have {g, DnVn<p) = {Vng, Dnf) 



We finish by recalling the argument to put all the pieces together. A technical point: 
unlike in previous treatments we do not assume that the eigenvalues are simple. 

Proof of Theorem 3.9. We first show that for all k we have = liminf A„,fc > A^. Assume 
that A;, < oo. The eigenvalues of if„ are uniformly bounded below by Lemma 3.13, so 
there is a subsequence along which (An,i, . . . , A„,a:) — ^ (G, • • • , ^fc = A^,.). By the same lemma, 
corresponding orthonormal eigenvector sequences have L*-norm uniformly bounded. Pass 
to a further subsequence so that they all converge as in Lemma 3.15, The limit functions 
are orthonormal; by Lemma 3.16 they are eigenfunctions with eigenvalues < A^ and we 
are done. 

We proceed by induction, assuming the conclusion of the theorem up to — 1. For j = 
0, . . . , /c — 1 let Vn,j be orthonormal eigenvectors corresponding to A„j-; for any subsequence 
we can pass to a further subsequence such that Vnj -^l^ fj, eigenfunctions corresponding 
to Aj. Take an orthogonal eigenfunction fk corresponding to A^ and find /| G with 
ll/fc ~ /fcIL < ^- Consider the vector 



The L*-norm of the sum term is uniformly bounded by Ce: indeed, the || v^i j 1 1 uniformly 
bounded by Lemma 3.13, while the coefficients satisfy \{vn,j, fl)\ < \\fl — /fc|| + — fj\\ < 
2e for large n. By the variational characterization in finite dimensions and the uniform L* 
form bound on {-^Hn-) (by Lemma 3.13) together with the uniform bound on HPn/lH^^ (by 



□ 



fe-i 
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Lemma 3.16), we then have 



hmsupA„,fc < hmsup— — — = hmsup —— -———— + o^(l), (3.25) 

\Jn,ki Jn,k) \> nJki ' nJkl 

where Oe(l) — )■ as e — )■ 0. But (3.24) of Lemma 3.16 provides lira {Vnfk: HnVnfk) = 
^(/fc; /fc); so right hand side of (3.25) is 

T^ifk^fk) I ^ _ T-l-{fk,fk) I /.N _ A 

\Jk^Jk) \Jk,Jk) 

Now letting e — )■ 0, we conclude limsup Xn^k < ^k- 

Thus Xn,k Afc; Lemmas 3.13 and 3.15 imply that any subsequence of the Vn,k has a 
further subsequence converging in to some f & L*; Lemma 3.16 then implies that / is an 
eigenfunction corresponding to A^. Finally, convergence is uniform over Wn, W > —b since 
the bound 3.13 is. □ 



4 CLT and tightness for Gaussian and Wishart models 

We now verify Assumptions 1-3 of Section 3 for the band Jacobi forms of Section 2 and thus 
prove Theorems 1.2 and 1.3 via Theorem 3.9, 

We must consider the band forms as (r x r)-block tridiagonal matrices. This amounts to 
reindexing the entries by {k + rj, I + rj), where j G Z+ indexes the blocks and 1 < A;, / < r 
give the index within each block. The scalar processes obtained by fixing k, I can then be 
analyzed jointly; finally, they can be assembled into a matrix-valued process. 

The technical tool we use to establish (3.4) is a functional central limit theorem for con- 
vergence of discrete time processes with independent increments of given mean and variance 
(and controlled fourth moments) to Brownian motion plus a nice drift. Appearing as Corol- 
lary 6.1 in RRV, it is just a tailored version of a much more general result given as Theorem 
7.4.1 in Ethier and Kurtz (1986). We record it here. 

Proposition 4.1. Let a G M and h G C^(]R+), and let Un he a sequence of discrete time real- 
valued processes with ynfl = and independent increments 6yn,j = Unj — ynj-i = ^nVnj ■ 
Assume that rrin — )■ oo and 

rrinESynj = h'{]/mn) + o(l), m„E(5?/„j)^ = + o(l), m„ E(5?/„jy = o(l) 

uniformly for j/nin on compact sets as n ^ oo. Then yn{x) = yn,im„x\ converges in law, 
with respect to the compact-uniform topology, to the process h{x) + ahx where bx is a standard 
Brownian motion. 
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Remark 4.2. Since the limit is a.s. continuous, Skorokhod convergence (the topology used 
in the references) implies uniform convergence on compact intervals (see Theorem 3.10.2 in 
Ethier and Kurtz 1986) and we may as well speak in terms of the latter. 



The Gaussian case 



Take G„ = Gn-o + y/nPn as in(2.4) with Gn-o as in (2.3) and Pn = Pn® ^n-r- We denote 
upper-left r x r blocks with a tilde throughout. Set 



2 



m„ = n 

fn 



As usual, this soft-edge scaling can be predicted as follows. Centering G„ by 2y/n gives, to 
first order, y/n times the discrete Laplacian on blocks of size r. With space scaled down by 
m„, the Laplacian must be scaled up by m,^ to converge to the second derivative. Finally, 
the scaling m„ = n^/^ is determined by convergence of the next order terms to the noise and 
drift parts of the limiting potential. 

Decompose Hn as in (3. 2), (3. 3). The upper-left block is 

Hn=ml + mn{Wn + y„,l;0) = "2^(2 - n^^/^G^.o - Pn)] 

we want the boundary term Wn to absorb the "extra" (the 2 in the right hand side 
"should be" a 1) and the perturbation in order to make l^,i;o small just like the subsequent 
increments of l^,i). We therefore set 

Wn=mn{l-Pn). 

With this choice Assumption 3 is an immediate consequence of the hypotheses of Theorem 
1.2, The processes V'n,i;^n,2 are determined and it remains to verify Assumptions 1 and 2. 

We begin with Assumption 1, identifying the limiting integrated potential Y : IR+ 
Mr{¥) as that of the multivariate stochastic Airy operator: 

Y{x) = V2B^ + Irx^ (4.1) 

where Br^ is a standard Mr{¥) Brownian motion and second term is a scalar matrix. 

Proof of (3.4), Gaussian case. Define scalar processes yk^i for 1 < I < r and I < k < I + r 
by 

yk,i= { ' / ; (4.2) 

<2.-., r + l<k<l + r. 
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(We have dropped the subscript n.) Equivalently, for 1 < /c, / < r, 



kl 



ylk k<i 
yk,i k>l, 



(2^n,2) 



yk+r,l k < I 



kl 







k>L 



(4.3) 



Then we have 



5y, 



k,l;j 



n 



-1/6 



^ 9k+rj k / 

~9k+rj,l+rj I < k < I -\- r 

^ 7^X(n-fc-ri+l)/3) k = l + r. 



(4.4) 



Note that the yk,i are independent increment processes that are mutually independent of one 
another. With the usual embedding j = [n^/'^xj, Proposition 4.1 together with standard 
moment computations for Gaussian and Gamma random variables — in particular 

Exa = v^ + 0(l/v^), E(xa-v^)' = l/2 + 0(l/a), E(xa - v^)' = 0(1), 

for a large (valid since we consider j = 0{n^^^) here) — leads to the convergence of processes 



ykA^j 



k = I 

hk,i{x), I < k < I + r 

\^^h{x) + Irx'^ k = l + r 



where bk, bk are standard real Brownian motions and b^^i are standard F Brownian motions. 
By independence, the convergence occurs jointly over k, I and the limiting Brownian motions 
are all independent. (For the F Brownian motions apply Proposition 4.1 to each of the (3 
real components, which are independent of one another.) Therefore Yn^i are both tight, and 
using (4.3) we have, jointly for 1 < k,l < r, 



yk,k + '2yk+ 



r,k 



{Yn,! + + Yn,2)),^, = { yk,l + V^r^k 

XLk + yk+r,l 



^ {bk + bk) + \rx^ 



h,l + &l"+r-,fc 



k = I 
k > I 
k<l. 



Noting that the two Brownian motions in each entry are independent and that the entries on 
and below the diagonal are independent of each other, we conclude that this limiting matrix 
process is distributed as F (a;) in (4.1). □ 



We turn to Assumption 2. Here we need bounds over the full range < j < [n/r] — 1. 
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Recall that we can extend the Yn^i processes beyond the end of the matrix arbitrarily 
takes care of the truncation), and it is convenient to "continue the pattern" for an extra 
block or two by setting Xq, = for a < 0. For the decomposition (3.5), we simply take rin,i 
to be the expectation of AF„ j and Aun^i to be its centered version; the components of rjn^i 
are then easily estimated and those of Un,i become independent increment martingales. We 
further set Tj{x) = rx. 

Proof of (3.6)-(3.8), Gaussian case. From (4.4) we have ?7n,i;i = and 

{'nn,2;j)k,i = E2mjyk+r,i;j = 2n^/^ (a/ti - E X(„-fc-r(i+i)+i)/3) lfc=/- 
The estimate 

Vi^^ < Ex. = v^^%^7^ < v^- (4.5) 

r(a/2) 

is useful. We obtain 

2\/n \ n 



for some fixed c, which yields the matrix inequalities 

rx — cn~^^'^ < rin,2{x) < 2rx + cn~^^^ 

and verifies (3.6) with fj{x) = rx. Separately, we have the upper bound (3.7): 

r?„,2(x) < 2n2/3 = 2ml. 

The bound (3.8) may be done entry by entry, so we consider the process {{uji^n;j)k,i}jez+ 
for fixed i = 1, 2 and 1 < k,l < r and further omit these indices; for the F-valued processes 
we restrict attention further to one of the /3 real-valued components, and denote the latter 
simply by Un-j- Consider (4.4); the key points are that the increments 6un;j are independent 
and centered, and that scaled up by n^/^ = ml/'^ they have uniformly bounded fourth 
moments. To prove (3.8) it is enough to consider x at integer points and show that the 
random variables 

e-l I _ |2 

sup X sup \^n;'m„x+j ^n;mnx\ 

x=0,l,. ..,n/rm„ j=l,...,m„ 

are tight over n. Squaring, bounding the outer supremum by the corresponding sum, and 
then taking expectations gives 

n/rnin -p i _ 1^ n/rrrin ^f^-p\ 1 4 

^ ^^Vj=l,...,m„ \^n;mnX+j ^n\mnx\ \ ^ iOH/ | CJn;r7i,i (z+1) ^nyirinx] 



^2-2e — 2^ 



— ^^^^^ Q^'^' ' 2£r 



x=0 x=0 
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where we have used the maximum inequahty for martingales (see e.g. Proposition 2.2.16 
of Ethier and Kurtz 1986). To bound the latter expectation, expand the fourth power to 
obtain O(m^) nonzero terms that are 0{m~^) with constants independent of x and n. It 
follows that the entire sum is uniformly bounded over n, as required. □ 

The Wishart case 

1/2 

Take Ln,p = T,n,pLn,pfi with Ln,pfl as in (2.5) and, denoting the upper-left r x r block with a 
tilde, S„ p = S„ p © InAp- Recall that L„ p is ((n + r) A p) x (n Ap). Put Sn^p = Lj^ pL^^p and 
similarly for Sn,p,o] these matrices are {n A p) x (n A p) and the latter is given explicitly in 
(2.6). We sometimes drop the subscripts n,p. Recall (2.7) that ^ - = ^J(S - 1)Lq © 0. 
We set 

m.,p=(-^=^) ' H^.= '^{{V^+V^T-Sn,p). (4.6) 

See Part I for detailed heuristics behind the scaling; written in this way, it allows that 
p, n — )• oo together arbitrarily, i.e. only n A p — t- oo. It is useful to note that 

2-2/^(ra Ap)^/^ < m„,p < {nApY^^. 

Decompose if„p as in (3. 2), (3. 3). The upper-left block is 

2 

H = m^ + m{W + Yi-o) = 2m' - ^ (So-n-p + L|,(S - l)Lo) . 

y/np 

As before we want W to absorb the extra m' and the perturbation in order to make Yi-q 
small. Now the perturbation term is random, but it does not have to be fully absorbed; it is 
enough that Yi-o — )■ in probability. The reason is that the process Yi can absorb an overall 
additive random constant that tends to zero in probability, as is clear in Assumption 1 while 
in Assumption 2 the constant may be put into Ui. Since Lq ^ ^/n, we set 

Wn,p = mn,p (^l - ^/njp(f:n,p - l)^ . (4.7) 

Once again. Assumption 3 follows immediately from the hypotheses of Theorem 1.3, 
We must still deal with the perturbed term in Yi-o and show that 

— (nt - LltLo) (4.8) 
y/np 

in probability. We defer this to the end of the proof of Assumption 1, to which we now turn. 
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As in the Gaussian case, Y is given by (4.1). 

Proof of (3.4), Wishart case. By the preceding paragraph it suffices to treat the null case 
S = / and afterwards check (4.8). Define processes yk,i for 1 < Z < r and / < < / + r by 
(4.2) as in the Gaussian case. From (2.6) with the centering and scaling of (4.6) and (3.3) 
we obtain 



Syk,l;j 



m 



np 



^ i^X(n—k—rj+l)l39k+rj,l+rj 

+X(p-l-r{j+l)+l)l3gt+r{j+l),k+rj) + '^(l) 
^/np — ^X{n-k-rj+l)pX{p-k~rj+l)l3 



I < k < I + r 
k = I + r 



where the 0(1) terms stand in for the interior Gaussian sums of (2.6), all of whose moments 
are bounded uniformly in n,p. Since m}^^ / [npY/"^ < m^~'^^ = o(l) for A; > 1, these terms 
are negligible in the scaling of Proposition 4.1 in the sense that the associated processes 
converge to the zero process. Next, use that expressions of type Xn — are 0(1) in the 
same sense, and that ^/n — \/n — j = Oi^jj ^fn) = Oimj ^fn) = o(l) since we consider j/m 
bounded here (and similarly for p), to write 



m 



np 



(v^(v^ - X{n-k-rj+l)l3) 

+ Vp{VPP - X{p-k-r{j+l)+l)p)) + 0(1) 
-V^gk+rj,l+rj - y/pgt+rU+l),k+rj + 0(1) 
73 {Vp(^^ - X{n-k-rj+l)l3) 

+ Vn{V^ - X(p-fc-rj+l)/3)) + 0(1) 



k = I 

I < k < I + r 
k = I + r 



(4.9) 



It suffices prove tightness and convergence in distribution along a subsequence of any 
given subsequence, and we may therefore assume that p/n — )■ 7^ G [0, 00]. Each case of (4.9) 
contains two terms, and each one of these terms forms an independent increment process 
to which Proposition 4.1 may be applied. (Break the F- valued terms up further into their 
real-valued parts.) Standard moment computations as in the Gaussian case, together with 
independence, then lead to the joint convergence of processes 



VkA^) 



k = I 

I < k <l + r 
k = I + r 
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where bk, bk are standard real Brownian motions and 6^^/ are standard F Brownian motions, 
all independent except that bk+r,i+r and bk,i are identified. Therefore Yn,i are both tight. 
Furthermore, using (4.3) we have 



yk,k + '^yk+r,k 

y*Lk + yk+r,i 



l{bk + bk) + \rx'^ k = l 
h,i + b^+r,k k > I 

b*if^ + bk+r,i k<l 



jointly for 1 < k,l < r. After the dust clears we thus arrive at exactly the same limiting 
process as in the Gaussian case, namely (4.1). 

We now address (4.8). Here we can replace Lq with ^/nIr at the cost of an error that has 
uniformly bounded second and fourth moments. Now (4.7) and the assumed lower bound 
on Wn^p give that S < 1 + l^-pjn for n,p large; this matrix inequality holds entrywise in 
the diagonal basis for S (which was fixed over n,p). One therefore obtains error terms with 
mean square 0{m} jn + rn? /p) = 0{m~^) which is o(l) as required. □ 

Turning to Assumption 2, we may continue the processes Y^^i past the end of the matrix 
for convenience just as in the Gaussian case. The Wishart case presents an additional issue 
at the "end" of the matrix: recall that the final r rows and columns of S in (2.6) may have 
some apparently non-zero terms set to zero. However, these changes are easily absorbed into 
the bounds that follow. For (3.5) we once again take rjn^i to be the expectation of AYn^i and 
Aun^i to be its centered version. We also set f]{x) = rx as before. 

Proof of (3.6)-(3.8), Wishart case. This time we have 

{'nn,i;j)k,i = EmSykj;j = m^{np)~^/'^[2ri - r + l) 1^=/, 

i'nn,2;j)k,l =E2mSyk+r,l-j = 2m^(l - /3"^(np)"^/^ E X(n-fc-ri+l)/3X(p-fc-ri+l)/3) lfc=Z- 

Using (4.5) one finds, for some constant c, that 

m-^{rj + c) < (r7„,i;j + 'nn,2;j)k,k < m~^{2rj + c) 

which yields (3.6) with r/(x) = rx. Separately, we have the upper bound (3.7). The oscilla- 
tion bound (3.8) may be proved exactly as in the Gaussian case: we have once again that 
{ v^^n {^n,i;j ) k,i ^''^^ martiugalcs with independent increments whose fourth moments 

are uniformly bounded. □ 
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5 Alternative characterizations of the laws 



In this section we derive the SDE and PDE characterizations, proving Theorems 1.5 and 1.6, 
First order linear ODE 

For each noise path Br,., the eigenvalue equation 'Hi3,wf = A/ can be rewritten as a first- 
order hnear ODE with continuous coefficients. We begin with the formal second order linear 
differential equation 

f{x) = {x-X + V2B',)f{x) (5.1) 
where / : IR+ with initial condition 

no) = Wf{0). (5.2) 

As usual we allow W G M*{¥) and interpret (5.2) via (3.13) . Rewrite (5.1) in the form 

{f-^Bf)' = ix-X)f-V2Bf'. 

Now let g = f — \/2Bf. The equation becomes 

g' = {x- A)/ - V2Bf' 

= {x-X- 2B^)f - V2Bg. 

In other words, the pair (^f{x),g{x)) formally satisfies the ffist order linear system 

(5.3) 

Since Bq = 0, g simply replaces /' in the initial condition (5.2). If one prefers, this condition 
can be written in the standard form 







V2B 


1 




'f 


_g'_ 




x-X-2B^ 


-V2B 




_g_ 



Wf{0) + igiO) = 



(5.4) 



where W = WiUiu] + J2^>ro ^nd / = J2i<ro ^^^I- 

One could allow general measurable coefficients and define a solution to be a pair of ab- 
solutely continuous functions {f,g) satisfying (5.3) Lebesgue a.e. This definition, equivalent 
to writing (5.3) in an integrated form, is easily seen to coincide with (3.18). As in Remark 
3.5, however, we note the coefficients are continuous; solutions may therefore be taken to 
satisfy (5.3) everywhere and are in fact continuously differentiable. It is classical that the 
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initial value problem has a unique solution which exists for all x G M+ (and further depends 
continuously on the parameter A and the initial condition W). 

Matrix oscillation theory 

The matrix generalization of Sturm oscillation theory goes back to the classic work of Morse 
Morse (1932) (see also Morse 1973). Textbook treatments of self-adjoint differential systems 
include that of Reid (1971). Our reference will be the paper of Baur and Kratz (1989), which 
allows sufficiently general boundary conditions. 

We first consider the eigenvalue problem on a finite interval [0, L] with Dirichlet boundary 
condition f{L) = at the right endpoint. In the scalar- valued setting, the number of 
eigenvalues below A is found to coincide with the number of zeros of / (the solution of the 
initial value problem) that lie in (0,L). The correct generalization to the matrix- valued 
setting involves tracking a matrix whose columns form a basis of solutions, and counting the 
so-called "focal points" . 

We need a little terminology and a few facts from Baur and Kratz (1989), especially 
Definition 1 on p. 338 there and the points that follow. A matrix solution of (5.3) is a 
pair F, G : M.^ — ^ jprxr g^^]^ that each column of is a solution. A conjoined basis 
for (5.3) is a matrix solution {F,G) with the additional properties that F'^G = G'^ F and 
rank^g] = r. The latter properties hold identically on as soon as they do at a single 
point; in particular, we may set -F(O) = / and G(0) = W to obtain a conjoined basis for 
the initial condition (5.4). A point x G IR+ is called a focal point if F{x) is singular, of 
multiplicity nullity F(x). The following proposition summarizes what we need from the 
more general results of Baur and Kratz (1989). 

Proposition 5.1. Consider the differential system 



'f' 




A{x) B{x) 




"/ 


9' 




C{x) - A -At 




9 



(5.5) 



with real parameter X, where A,B,G are n x n matrices depending continuously on x G M 
and B, C are Hermitian. Consider a conjoined basis with some fixed initial condition at 0. 
Consider the associated eigenvalue problem on [0, L] with the same boundary condition at 
and Dirichlet condition at L. Then, for all A G M, the number of focal points of the conjoined 
basis in (0, L) equals the number of eigenvalues below X. Furthermore, if B{x) is nonsingular 
for X G M then the spectrum is purely discrete and bounded below with eigenvalues tending 
to infinity. 

Proof. The idea is that focal points are isolated and move continuously to the left as A 
increases. For sufficiently negative A there are no focal points on (0,L]; each time A passes 
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an eigenvalue, a new focal point is introduced at L. 

We indicate how the proposition follows from the results of Baur and Kratz (1989). Note 
that conditions (Al), (A2) on p. 337 are satisfied by our coefficients, and that (A3) on p. 340 
is satisfied by our boundary conditions. Theorem 1 on p. 345 thus applies. See (3.5) on 
p. 341 for the definition of A(A); the Dirichlet condition at L gives the particularly simple 
result that the right hand side of (4.1) vanishes, so the quantity n2{X) is constant. Theorem 
2 applies as well, and we obtain ni(A) —ni= n^^X). Here ni{\) is the number of focal points 
in [0,L), rii = limA^-oo ''^i(A) and n^^X) is the number of eigenvalues below A. To finish we 
consult Theorem 3 on p. 353; noting that (A4') is satisfied by Section 7.2, p. 365, to find 
that rii is simply the multiplicity of the focal point at 0. The oscillation result follows. For 
the assertion about the spectrum we apply Theorem 4, noting that (A5), p. 358 holds by (i) 
there, and (A6), p. 359 also holds (with Cq = I). □ 

We conclude the following for our matrix system. 

Lemma 5.2. For each A G M, consider the eigenvalue problem (5.3) on [0,L] with boundary 
conditions (5.4) and f{L) = 0. Let {F,G) be the conjoined basis initialized by F{0) = I 
and G{0) = W ; then the number of focal points in the interval (0,L), counting multiplicity, 
equals the number of eigenvalues below X. Furthermore, the spectrum is purely discrete and 
bounded below with eigenvalues tending to infinity. 

A soft argument now recovers an oscillation theorem for the original half-line problem. 

Theorem 5.3. Consider the eigenvalue problem (5. 3), (5. 4) on L^(]R+). For each A G M, let 
{F,G) be the conjoined basis as above; then the number of focal points in {0,oo) equals the 
number of eigenvalues strictly below X. 

Proof. Let A/, jt, A^, A; = 0, 1, . . . denote the lowest eigenvalues of the truncated and half-line 
operators T-Ll, Ti respectively; it suffices to show that limL^oo ^L,k = A^ for each k. Indeed, 
taking L — )■ oo in Lemma 5.2 then yields the conclusion for each A G ]R\{Ao, Ai, . . .}. Letting 
A \ Afc, the right- most focal point must tend to oo by monotonicity and continuity, so the 
claim actually holds for all A G M. 

The variational problem for Tii simply minimizes over the subset of L* functions that 
vanish on [L, oo); the Dirichlet condition is important here. It follows immediately that 
^L,k > Afc, using the min-max formulation of the variational characterization. Proceed by 
induction, assuming that Alj — A^ for j = 0, . . . , — 1. 

Let f^j be orthonormal eigenvectors corresponding to Aij. By the induction hypothesis, 
the variational characterization for and the finite-dimensionality of its eigenspaces, every 
subsequence has a further subsequence such that f^j -^^2 fj, eigenvectors corresponding 
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to Aj. Let fk be an orthogonal eigenvector corresponding to and take compactly 
supported with H/l — /fc||^ < Let 

fc-i 

j=0 

For large L the inner products are at most 2e, so \\gL — /a;|L < ce. Noting that is 
eventually supported on [0,L], the variational characterization gives 

lim sup A < hm sup — — 

L-s>oo ' L^oo {gLiQLI 

and the right-hand side tends to Tii^fk, fk)/ {fk, fk) = A^ as e — )■ 0. □ 



Riccati SDE: stochastic Airy meets Dyson 

Let (F, G) be a conjoined basis for (5.3) as defined in the previous subsection. Then, on any 
interval with no focal points, the matrix Q = GF~^ is self-adjoint and satisfies the matrix 
Riccati equation 

Q' = rx-\-{Q + ^Bf. (5.6) 

(see p. 338 of Baur and Kratz 1989). 

As X passes through a focal point xq, an eigenvalue g of Q "explodes to — oo and restarts 
at +oo". The precise evolution of Q near xq can be seen by choosing a G M so that 
Q = {Q — a)^^ = F{G — aF)^^ is defined; then Q satisfies 

Q' = (1 + Q{V2B + a)) (1 + {^/2B + a)Q) - (x - \)Q\ (5.7) 

Writing q = l/(g — a) and v for the corresponding eigenvector, notice how 

g'(xo) = v{xo)'^Q'{xq)v{xo) = 1. 

Thus q is "pushed up through zero," corresponding to the explosion/restart in g = 1/g + a. 
In this way we may consider Q{x) G M*(F) to be defined for all x. The initial condition is 
then simply Q{0) = W. 

Now let P = F'F'^. While P = Q + ^/2B is not differentiable, by (5.6) it certainly 
satisfies the integral equation 

P,, - P,, = v^(P,, - P.J + / (ry - A - P2) dy 
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if [xi,X2] is free of focal points. In other words, P is a strong solution of the Ito equation 



dP^ = V2dB^ + {rx - X- P^)dx (5.8) 

off the focal points. The evolution of P through a focal point can be described in the 
coordinate P = {P — a)^^ = F{F' — aF)~^. Using (5.7) and Ito's lemma one could write 
down an SDE for P = Q{1 + \/2BQ)^^. The initial condition here is also -P(O) = W. 

Consider the eigenvalues pi, . . . ,pr of P. The main point is that the drift term in (5.8) is 
unitarily equivariant and passes through the usual derivation of Dyson's Brownian motion 
(Dyson 1962). The eigenvalues therefore evolve as an autonomous Markov process. 

To describe the law on paths we need a space, and there are two issues: it will be necessary 
to keep the eigenvalues ordered but also allow for explosions/restarts. We therefore define a 
sequence of Weyl chambers C (—00,00]^' by 

Co = {pi < ■ ■ ■ < pr} 

Cl = {P2 < ■ ■ ■ < Pr < Pi} 

C2 = {ps < ■ ■ ■ < Pr < Pi < P2} 

and so on, permuting cyclically. We glue successive adjacent chambers together at infinity 
in the natural way to make the disjoint union C = Cq U Ci U . . . into a connected smooth 
manifold. That is, taking pi —00 in Cq puts you at pi = +00 in Ci, the smooth structure 
is defined by the coordinate pi = 1/pi, which vanishes along the seam. Glue C^-i to Ck 
similarly along {pkmodr = 00}. We also define Ck, C in which some coordinates may be 
equal, and dC^ = Ck \ Ck, dC = C\C in which some coordinates are equal. 

Theorem 5.4. Represent the eigenvalues of W E M*{¥) as w = {wi . . . ,Wr) G Co- The 
eigenvalues p = {pi, . . . ,pr) of P evolve as an autonomous Markov process whose law on 
paths ]R+ — 7- C zs the unique weak solution of the SDE system 



■jpdbi + j rx - A - + 



dpi = -^dhi + I rx — A — + | dx (5.9) 

with initial condition p(0) = w, where hi, . . . ,hr are independent standard real Brownian 
motions. An eigenvalue Pi can explode to —00 and restart at +00, meaning p crosses from 
Ck to Ck+i; the evolution through an explosion is described in the coordinate pi = 1/pi, which 
satisfies 

dp^ = -j-pPldh + I 1 + (a - rx + ^ ^^^)p1 + |pf ) dx. (5.10) 
Proof. Deriving (5.9) from (5.8) is simply a matter of applying Itb's lemma, at least in C 
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where the eigenvalues are distinct. One needs to differentiate an eigenvalue with respect 
to a matrix, and this information is given by Hadamard's variation formulas. In detail, let 
A G Mr{¥) vary smoothly in time and suppose A{0) has distinct spectrum. Then eigenvalues 
Xi, . . . , Xr of A and corresponding eigenvectors Vi, . . . ,Vr vary smoothly near by the implicit 
function theorem. Differentiating Avi = XiVi and v\vi = 1 leads to the formulas 



A, 



Xi — Xj 



Writing X = A{0) and Vx for the directional derivative, and taking fi(0), . . . ,fr(0) to be 
the standard basis, we find 



VxAj — Xii, V^Aj — 2 — 



Xi — Xj 

Returning to (5.8), at each fixed time x we can change to the diagonal basis for because the 
noise term is invariant in distribution and the drift term is equivariant. Ito's lemma amounts 
to formally writing dpi = VdPVi + \^'dpPi ^-^d using that dBa are jointly distributed as 
\/2/ 13 dhi for z = 1 . . . , r while \dBij^ = dt for j ^ i. We thus arrive at (5.9). 

Recall that the evolution of P through a focal point is still described by an SDE, after 
changing coordinates. The same is therefore true of p through an explosion; the form (5.10) 
is obtained from (5.9) by an application of Ito's lemma. 

Just as with the usual Dyson's Brownian motion, the Pi are almost surely distinct at all 
positive times: p(x) G C for all x > 0. One can show this "no collision property" holds for 
any solution of (5. 9), (5. 10), even with an initial condition p(0) G dCo. (Technically, one 
defines an entrance law from dC by a limiting procedure.) Since the coefficients are regular 
inside C, this suffices to prove uniqueness of the law. See Anderson et al. (2009), Section 
4.3.1 for a detailed proof in the driftless case. □ 

Proof of Theorem 1.5. Explosions of p as in Theorem 5.4 correspond to focal points of F 
for each A. By Theorem 5.3, the total number of explosions K is equal to the number of 
eigenvalues strictly below A. (Notice that p ends up in Ck-) For a fixed A, translation 
invariance of the driving Brownian motions bi allows one to shift time x ^-> x — A/r and 
use (1.3) started at xq = —X/r. Putting a = —X we have P(— < a) = P(Afc > A) = 
Pa/r,v/{K < k) as required. □ 
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PDE and boundary value problem 

We now prove the PDE characterization, Theorem 1.6, We will need two properties of the 
eigenvalue diffusion. 

Lemma 5.5. Let p : [xq,oo) — t- C have law Pxo,w 'in- (1-3) and let K he the number of 
explosions. Then the following hold: 

(i) Given xq, k, Pxo,vf{K < k) is increasing in w with respect to the partial order w < w' 
given by Wi < w[, i = 1, . . . ,r . 

(a) Pxo,w-almost surely, pi, . . . ,pr remain bounded below in Ck (after the last explosion) , 
or equivalently in Cq on the event {K = 0}. 

Proof. Part (i) is a consequence Theorem 1.5 and Remark 1.1, the pathwise monotonicity 
of the eigenvalues as a function of the boundary parameter W with respect to the usual 
matrix partial order. It can also be seen from the related fact that the matrix partial order 
is preserved pathwise by the matrix Riccati equation (5.8), which implies that a solution 
started from W explodes no later than one started from W > W. This fact holds for the 
P evolution if it holds for the Q evolution (5.6), and for the latter it is Theorem IV.4.1 of 
Reid (1972). 

Part (ii) follows from the stronger assertion that pi ~ ^/rx as a; — > oo. In the r = 1 case 
this is Proposition 3.7 of RRV, Heuristically, the single particle drift linearizes at the stable 
equilibrium ^/rx to 2^fTx[^\frx — pi)] even with the repulsion terms one expects fluctuations 
of variance only C / ^Jx. We omit the proof. □ 

Proof of Theorem 1.6. Assume the diffusion representation of Theorem 1.5 for Fg(x;w) = 
P(~Ao < x) on M X Cq. We first show F = F/s has the asserted properties and afterwards 
argue uniqueness. Writing L for the space-time generator of (1.3), the PDE (1.6) is simply 
the equation LF = after replacing x with x/r. In other words, it is the Kolmogorov 
backward equation for the hitting probability (1.4) (more precisely, the probability of never 
hitting {wi = — oo}), which is L-harmonic. This extends to Wr = +oo by using the local 
coordinate there; from (5.10) one sees that the coefficients remain regular. Although the 
diffusivity vanishes at Wr = +oo, the drift does not, and it follows that F is continuous up to 
Wr = +00. The PDE holds even at points w G OCq with appropriate one-sided derivatives; 
notice that the apparent singularity in the "Dyson term" of the PDE is in fact removable 
for F regular and symmetric in the Wi. (For a toy version, consider a function / : M — t- M 
that is twice different iable and even; then /' is odd and f'{w)/w is continuous with value 
/"(O) aX w = 0. These functions form the domain of the generator of the Bessel process 
on the half-line {w > 0} in the same way that symmetric functions form the domain of 
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the generator of Dyson's Brownian motion on a Weyl chamber.) Finally, the picture can be 
copied to w G (—00,00]^ by symmetry, permuting the Wi. 

The boundary condition (1.7) follows from the monotonicity property of Lemma 5.5 (i). 
For fixed w, F{x\ w) — t- 1 as x — t- 00 because it is a distribution function in x; by mono- 
tonicity in w, the convergence is uniform over a set of w bounded below. To understand the 
boundary condition (1.8) (using Wi in Cq), change to the coordinate wi = l/wi and close the 
domain to include the "bottom boundary" {wi = —00}. Then (1.8) becomes an ordinary 
Dirichlet condition. While the diffusivity vanishes on this boundary, the drift is nonzero into 
the boundary. The hitting probability is therefore continuous up to the boundary. 

For F'' there is the following more general picture. Consider the PDE in Co U ■ • ■ U C^, 
defined across the seams by changing coordinates as in (5.10). Put the boundary condition 
(1.7) on all the chambers and (1.8) on the bottom of Ck- Then the solution is in Cq; 
the reason is the same as for F = F^, but now using (1.5) and the hitting event "at most k 
explosions". Similarly, the solution is in Ci and so on down to F^ in Ck- Continuity 
holds across the seams and (1.9) follows after permuting coordinates. 

Toward uniqueness, suppose F is another bounded solution of the boundary value prob- 
lem (1.6)-(1.8) on M X Cq. With the notation of Theorem 1.5, F{rx; p^) is a local martingale 
under Pxq.w by the PDE (1.6). It is therefore a bounded martingale. Let ( G (xo, 00] be the 
time of the first explosion; optional stopping gives F(rxo; w) = F[r(( A x);p^Ax) for 

all X > Xq. Taking x — > 00, we conclude by bounded convergence, the boundary behavior 
(1.7), (1.8) of F and Lemma 5.5 (ii) that F(rxo,w) = Pa;o,w(C = 00). By Theorem 1.5, this 
probability is F^(rxo, w). One argues similarly for the higher eigenvalues. □ 

6 Connection with Painleve II 

In Part I we used the PDE characterization to give new proofs of certain Painleve II formulas 
for the single-parameter (rank one deformed) distribution functions -F/3(x; w) in the cases /3 = 
2, 4, in particular recovering the Painleve II representations for the corresponding undeformed 
Tracy- Widom distributions by taking w 00. The Painleve formulas appeared originally 
in Baik and Rains (2000, 2001) in a different context; in the random matrix theory setting, 
Baik (2006) derived them from the BBP result in the case /3 = 2 but they are new for /3 = 4 
when w ^ (see Wang 2008). 

Baik (2006) also derives a Painleve II formula for the multi-parameter distribution func- 
tion F2{x;wi, . . . , Wr). While we do not have an independent proof at present, we used the 
computer algebra system Maple to verify symbolically that it does indeed satisfy our PDE 
(1.6) at /3 = 2 for r = 2,3,4,5. (Of course one would hope for a pencil-and-paper proof for 
all r. It would then remain to verify boundedness and the boundary conditions (1.7), (1.8) 
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independently.) We first state Baik's formula and then briefly describe the symbolic compu- 
tation. 

Let u{x) be the Hastings-McLeod solution of the homogeneous Painleve II equation 

u" = 2u^ + xu, (6.1) 

characterized by 

u{x) ~ Ai(a;) as x — )■ +00 
where Ai(x) is the Airy function. Put 

vix) = l-u\ (6.2) 
E{x) = exp{- u), F{x) = exp{- v). (6.3) 

Next define two functions f{x,w), g{x,w) on M^, analytic in w for each fixed x, by the first 
order linear ODEs 

\9 j \ —wu + u' w"^ — X — j \g 



(6.4) 



and the initial conditions 

/(x,0) = E{x) = g{x,Q). 

Equation (6.4) is one member of the Lax pair for the Painleve II equation. The other member 
of the pair is 

dx\gj \u{x) ~w \g 



(6.5) 



which holds for each fixed ti; G M. The consistency condition for the overdetermined system 
(6. 4), (6. 5) (i.e. that the partials commute) is the Painleve II equation (6.1). The functions 
/, g can also be defined in terms of an associated Riemann-Hilbert problem (see e.g. Baik 
2006). 

Baik's formula is 

det( {w, + £Y''f{x,Wi)) 
F,{x;w,,..., Wr) = F{x) (6.6) 

Our symbolic verification for small values of r consisted of the following steps. The 
differential relations given by (6.1)-(6.5) were encoded as formal substitution rules. The 
determinant in (6.6) was expanded (this step becomes problematic for larger r. . . !) and the 
result plugged into our PDE (1.6). The substitution rules were then applied repeatedly. 
Finally, the result was factored using Maple's built-in command. Each time, the output 
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contained the factor 

V + U — (U) + XM , 

which vanishes identically: differentiate and apply (6.1) to see it is constant, and take a; — )■ oo 
to see the constant is zero. 
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